Genetic Algorithms and Cross-correlation Clustering of Time Series
نویسنده
چکیده
It may turn of interest the problem of finding a proper partition of a set of time series into clusters, where all time series that belong to the same cluster are significantly correlated each other. An obvious difficulty arises because the cross-correlation between each pair of time series is a function of the time lag. Several dissimilarity indexes which take into account the cross-correlation function are considered and compared in order to point out their relative merits and effectiveness in recovering the true cluster structure. A suitable internal criterion for evaluating the computed partition is presented, and its maximisation by means of a genetic algorithm is proposed. A simulation experiment is performed that may suggest what dissimilarity measure is best and what parameters had better selected for implementing the genetic algorithm. Comparison with the well-known and widely used single linkage method, and with the artificial neural network procedure which introduced into practice the clustering criterion that, with some modifications, is adopted here, is carried out as well.
منابع مشابه
Fuzzy clustering of time series data: A particle swarm optimization approach
With rapid development in information gathering technologies and access to large amounts of data, we always require methods for data analyzing and extracting useful information from large raw dataset and data mining is an important method for solving this problem. Clustering analysis as the most commonly used function of data mining, has attracted many researchers in computer science. Because o...
متن کاملAssessment of the Performance of Clustering Algorithms in the Extraction of Similar Trajectories
In recent years, the tremendous and increasing growth of spatial trajectory data and the necessity of processing and extraction of useful information and meaningful patterns have led to the fact that many researchers have been attracted to the field of spatio-temporal trajectory clustering. The process and analysis of these trajectories have resulted in the extraction of useful information whic...
متن کاملMissing data imputation in multivariable time series data
Multivariate time series data are found in a variety of fields such as bioinformatics, biology, genetics, astronomy, geography and finance. Many time series datasets contain missing data. Multivariate time series missing data imputation is a challenging topic and needs to be carefully considered before learning or predicting time series. Frequent researches have been done on the use of diffe...
متن کاملAn Efficient Predictive Model for Probability of Genetic Diseases Transmission Using a Combined Model
In this article, a new combined approach of a decision tree and clustering is presented to predict the transmission of genetic diseases. In this article, the performance of these algorithms is compared for more accurate prediction of disease transmission under the same condition and based on a series of measures like the positive predictive value, negative predictive value, accuracy, sensitivit...
متن کاملAlgorithms for Segmenting Time Series
As with most computer science problems, representation of the data is the key to ecient and eective solutions. Piecewise linear representation has been used for the representation of the data. This representation has been used by various researchers to support clustering, classication, indexing and association rule mining of time series data. A variety of algorithms have been proposed to obtain...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000